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Abstract — In modern digital systems large capacity and data 
transfer rate is required. Synchronous DRAM (SDRAM) 
became the memory of choice due to its speed, burst access 
and pipeline features. A Controller is required to provide proper 
commands for SDRAM initialization, read/write accesses and 
memory refresh. In semiconductor memories there are 
chances of errors. To ensure reliable data storage, an error 
correction and detection scheme is required. This paper 
describes the architecture design and characterization of 
SDRAM Controller IP core with built in Error Correcting 
Codes (ECC) module which is vendor neutral. The design is 
described using Verilog HDL, simulated using ModelSim 
and prototyped in Altera® platform FPGA. Resource 
utilization and power analysis was done using Altera® Quartus 
II. Hardware test results are obtained from Signal Tap Logic 
Analyzer. 

Index Terms — Synchronous DRAM, Error Correcting Codes, 
IP core, Power analysis, Resource utilization. 

I. Introduction 

Synchronous DRAMs (SDRAMs) become the memory 
of choice in many digital systems because it provides a 
significant improvement in bandwidth performance over 
traditional asynchronous DRAMs such as "FPM" (Fast Page 
Mode) and "EDO" (Extended Data Out). In Synchronous 
DRAMs input address, data, and control signals are typically 
latched on the positive edge of the clock signal. SDRAMs 
offer several features such as multiple internal banks, burst 
mode access, and pipelining of operation executions, that 
helps to improve bandwidth performance. 

There are two popular types of SDRAM in market. The 
most common single data rate (SDR) SDRAM transfers data 
typically on the rising edge of the clock. The other is the 
double data rate (DDR) SDRAM [5] which transfers data on 
both the rising and falling edge to double the data transfer 
throughput. Other than the data transfer phase, the different 
power-on initialization and mode register definitions; these 
two SDRAMs share the same command set and basic design 
concepts. This paper describes a design that is targeted for 
SDR SDRAM. However, due to the similarity of SDR and 
DDR SDRAM, this design can also be adapted for a DDR 
SDRAM controller. For benchmarking purpose, the Micron® 
SDR SDRAM MT48LC8M16A2 is chosen as a target for this 
design. In semiconductor memories there are chances of 
errors. Therefore an error correction and detection scheme 
can be used to provide reliable data storage. Also, this design 
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has been verified by using memory simulation model. 

The section II of this paper is a tutorial review of 
Synchronous DRAMs. The section III describes the design 
of the proposed SDRAM Controller with ECC module. The 
section IV describes the simulation results. The section V 
describes the implementation results of the proposed 
controller. 

II. Synchronous Dram Review 

SDRAM, or Synchronous Dynamic Random Access 
Memory is a form of semiconductor memory that can run at 
faster speeds than conventional DRAM. Since SDRAM has 
a synchronous interface, it has an internal finite state machine 
that pipelines incoming instructions. Thus the speed of 
operation is much higher. 

The Micron® 128Mb [4] SDRAM referenced in this paper 
is a high-speed CMOS, dynamic random access memory 
containing 134,217,728 bits. It is internally configured as a 
quad-bank DRAM with a synchronous interface. Each of the 
33,554,432-bit banks is organized as 4,096 rows by 512 
columns by 16 bits. The 128Mb SDRAM is designed to 
operate in 3.3V memory systems. All inputs and outputs are 
LVTTL-compatible. 

III. Proposed Controller Design 

A. Introduction 

The proposed SDRAM Controller is designed to work 
with a standard memory from Micron Technology® with series 
MT48LC8M16A2™. It has a user interface end on one side 
and 128Mb SDRAM on the other end. The design is coded 
in Verilog HDL. The controller offers facility for programmable 
burst lengths of 1, 2, 4, and 8, programmable CAS latency of 
2 and 3. 

Initialization should be done before applying any normal 
operation. Read and Write should be performed only after 
the initialization. The proposed controller design 
automatically performs all the initialization procedures [1,2, 
4]. ECC block consists of encoder and decoder-corrector, 
which can detect and correct single-bit errors and detect 
double -bit errors. 

B. Block Diagram 

The block diagram of proposed SDRAM Controller is 
shown in figure 1 . The main function of the controller is to 
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convert user commands to commands that can be 
understandable by the memory [6]. Another is to detect and 
correct single-bit errors and detect double-bit errors[3]. The 
main blocks include datapath, finite state machine, 
initialization control block and ECC block. 



Table I. Pin Description 
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Figure 1. Controller block diagram 

The inputs like command (cmd), reset are collectively 
shown as control input. The memory address is given through 
the address input. The data during write operation is given 
through the data input. Data during read operation is obtained 
through data output. Error status is also obtained as an 
output. 

C. Pin Description 

The input output diagram of controller is shown in figure 
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Figure 2. Pin out diagram of controller 

Type and the function of all the pins are described in 
table I. 

D. Detailed Architecture 

The detailed architecture of the proposed controller is 
shown in fig 3. 

E. Initialization Control State Machine 

According to architecture and working principle of 
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PIN 


TYPE 


DESCRIPTION 


sys Llk 


Input 


Input dock to the controller. Used to 
generate the dock (dk) to the memory. 


Reset 


Input 


System reset - L sed to reset the total system. 


Cmd 

mi 


Input 


Command input which distinguishes between 
different operations like P'recharge. refresh, 
ioad mode register, active, read and write. 


Data in 
[10*] 


Input 


11 -bit data input to the controller which ts 
used as data during write operation. 


Data out 
[10*] 


Output 


11-bit data output from the controller during 
read operation. 


Addr 
[230] 


Input 


2 3 -bit Address bus. From this 2 5 -bit address 
row address, column address and bank 
address is decoded. 


Dm [ID] 


Input 


Dm is used to mask input data during write 
cvcle and output data during read cyde. 


Cfk 


Output 


CLK is driven by the system dock Typical 

1 1 fLfil ITT 1 1 1^1 ITT 

i-alues are lOOMHzand UjMKz. 


Saddr 
[11*] 




Address input AO-All are sampled during 
the ACTIVE command (row address AO- 
All) and READ 'WRITE command 
(column-address A0-AS). 


Ba[l*] 


Output 


Bank address is used to select the bank 


Cks 


Output 


CKE activates (HIGH) and deactivates 
(LOW) the CLK signal. 


Csa 


Output 


CS enables (LOW) and disables (HIGH) the 
coiumand decoder. All commands are 
masked when CS is registered HIGH. 


Rasn 


Output 


Command inputs WE. CAS. and RAS (along 
with CS) define the command being entered. 


Casn 


Output 


Wis 


Output 


Error 
[1*] 


Output 


Error is an output signal used to display the 
error status. If it is 'W'\ no error has 
occurred. If it is : *)C" : single-bit error has 
occurred and corrected. 'A it is : '01". double- 
tit error has occurred and not corrected. 
Value "IF is not possible 


Dqm 
[1*] 


Output 


DQM is an output mask signal for write 
accesses and an output enable signal for read 
accesses. Input data to the memory is masked 
when DQM is sampled HIGH during a 
WRITE cyde. The output buffers are placed 
in a Higi-Z state (2-dock latency') when 
DQM is sampled HIGH during a READ 
cyde. DQM is generated from Dm signal. 


Dq [15:0] 


Inout 


Bidirecticnd data bus which is connected to 
the bidirectional data bus of memory. During 
write operation Dq is directed from controller 
to the memory. During read operation Dq is 
directed from memory to the controller 



SDRAM [4], the controller design uses two Finite State 
Machines to implement timing-logic control [1,2]. The 
SDRAM must be powered up and initialized in a predefined 
manner before any normal operation. Automatic initialization 
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Figure 3. Architecture of the controller 



is carried out by this state machine. The main processes in 
initialization is shown in figure 4. 
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Figure 4. Initialization process flow 

Initialization process consist of sequence of operations. 
First Precharge all banks(PRE) should be done. Precharging 
is required to deactivate all banks and put them in idle state. 
After precharging wait for ^(150 ns) period and execute 
Auto Refresh (REF). After autorefresh wait for t RFC (495 ns) 
period and again apply Auto Refresh. Again wait for t 
period and then Load Mode Register(LMR) command should 
be issued. With this command some special attributes are set 
(i.e. burst packet length , access type, CAS latency and 
other). Then wait for t,^ (15 ns) period. After the initialization, 
the SDRAM goes into the idle state, which is the initial state 
of the main FSM which is explained below. 

F. Main Finite State Machine 

Another state machine is the main FSM or the core of the 
controller which is shown in figure 5. The FSM generates 
control signals corresponding to the state. Transition 
between different states is according to the command input. 

The controller awakens in the IDLE state and then 
changes to Precharge All, Precharge Selected, Load Mode 
Register, AutoRefresh, or Active depending on the system 
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Figure 5. Main FSM 

command. The dashed lines in the state machine diagram 
indicate the automatic transfer. For Read and Write, the 
controller first goes into Active state. During active state 
row address and bank address is decoded from the address 
input. Controller enters write state when command is 1 10 and 
read state when command is 1 1 1 . 

G. Datapath Circuit 

The data flow design between the SDRAM and the system 
interface is shown in figure 6. The datapath circuit consists 
of four number of 16-bit D flip flops and a tristate buffer. Flip 
flop is used to shift the data. The data passing through the 
datapath circuit is either the codeword from the ECC Encoder 
or codeword from the memory. Tristate buffer is used to 
determine the direction of bidirectional data bus. Tri state 
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buffer has a control input "oe" which determines whether it 
is a read or write operation. It is obtained as an output from 
the main finite state machine. 



D=t; in- 
[15*] 



Dsta out 
[15:0]*" 



D Q 
> 




D Q 
> 







H> 



Q D 

< 






< 







DQ 
[15*] 



Figure 6. Datapath circuit of the controller 

H. Error Correcting Codes (ECC) block 

Hamming Codes can be used to detect and correct single- 
bit errors and detect double-bit errors. It is relatively simple 
yet powerful ECC code. It involves transmitting data with 
multiple check bits(parity) and decoding the associated check 
bits when receiving data to detect errors. ECC block comprises 
an encoder and a decoder-corrector which can detect and 
correct single-bit errors and detect double-bit errors. 

The number of parity bits, m, needed to detect and correct 
a single bit error in a data string of length n is given by the 
following equation(l) [3]. 



m = log n +1 



(1) 



The ECC block uses the Hamming code with an additional 
parity bit, which can detect single and double-bit errors, and 
correct single-bit errors [3, 7]. The extra parity bit applies to all 
bits after the Hamming code check bits have been added. 
This extra parity bit represents the parity of the codeword. If 
one error occurs, the parity changes, if two errors occur, the 
parity stays the same. In general the number of parity bits, m, 
needed to detect a double-bit error or detect and correct a 
single-bit error in a data string of length n, is given by the 
following equation(2) [3]. 



m = log n +2 



(2) 



The target memory is 16 bit wide. Therefore sum of data 
bits and parity bits should not exceed 16 bits. 16-bit codeword 
can provide error correction and detection to 1 1-bit data. For 
an 11 -bit message there are 11 possible single-bit errors. 
Therefore hamming code (16,11) can be used in this case. 
(16,1 1) hamming code means 1 1 bit data, 5 parity bits and 16 
bit codeword. Hamming codeword is a concatenation of the 
original data and the parity bits. 

ECC block comprises ECC encoder and ECC decoder- 
corrector. Encoder converts the 11 -bit input message into 16- 
bit codeword by placing parity bits in power-of-two positions. 
This 16-bit codeword is stored in the memory. Decoder- 
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corrector detects any error in the received codeword by 
calculating syndrome bits and corrects single-bit errors by 
using a mask. 

ECC Encoder 

The encoder takes the input data and encodes the 
message into a (1 1 + 5) bit codeword. It uses the principle of 
Hamming Code with an additional parity bit. The parity bits 
can be calculated according to the equations given in table 
II. The parity bit P[0] is generated by XORing the data bits 
that have "1" at position "k" in the address field xxxk [3,8]. 
Similarly P[l], P[2] and P[3] are also calculated. P[4] is 
generated by XORing all the data bits and previously 
calculated parity bits. 

Table II. Parity Bits Calculation 
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Paritv- 
P[4*] 


Equation [Input Data -d[10:0]] 


P[0] 


d[0] A d[l] A d[3] A d[4] A d[6] A d[S] A d[10] 


m 


d[0] A d[2] A d[3] A d[5] A d[fJ] A d[9] A d[10] 


PR] 


d[l] A dRj A dp] A CO A d[S] A m A d[10] 


m 


d[4] n d[5] A d[o] n df7] A d[S] ''' d[9] A d[10] 


P[4] 


d[0] A d[l] '"- d[2] A d[3] n d[4] A dp] A d|S] A dfT| "d[S] A 
d[9] A d[10] '"■ F{3] A PP] A P[l] A P[0] 



The parity bits are placed in power-of-two positions 
(1,2,4,8). The position of parity bits and information bits are 
shown in figure 7. 



P[4] 


P[0] 


P[l] 


d[0] 


m 


d[l] 


d[2] 


d[3] 





1 


2 


3 


4 


5 


6 


7 


P[J] 


d[4] 


m 


m 


d[7] 


d[S] 


m 


d[10] 


S 

" 


9 


10 




u 

i i 


12 

i 


13 


14 


15 



Figure 7. Codeword (16-bit) 

ECC Decoder-Corrector 

The decoder-corrector is a self-contained entity for mes- 
sages of 1 1 bit. The decoder creates a syndrome and sends 
it with the received message to the corrector. The corrector 
extracts the syndrome bits and analyse it. By analysing the 
value of syndrome bits, the corrector detects single-bit and 
double -bit errors and corrects the single bit errors of the 
received message according to the syndrome. The process 
flow diagram of decoder-corrector block[7] is shown in fig- 
ure 8. 

In the syndrome computation [3] step, decoder computes 
five syndromes using the 16-bit codeword received from the 
memory. The syndromes S[0] to S[4] are computed by XORing 
the encoder parity bits P[0] to P[4] and decoder parity check 
bits C[0] to C[4]. Encoder parity bits are obtained from the 
power-of-two positions in the codeword received. Decoder 
parity check bits are obtained by calculating the parity of the 
bits in the received codeword. Equation for syndrome is given 
below. 
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Figure 8. Decoder process flow 

Syndrome <= check A parity (3) 

The corrector module is responsible for error diagnosis 
and error correction. Principle of error correction and error 
diagnosis is explained below. 

Error Diagnosis 

The ECC block can diagnose if a one- or two-bit error has 
occurred on the data from memory. To indicate error status a 
2-bit error signal is provided. Errorfl ] indicate single bit error 
and error[0] indicate double bit error. Diagnosis table is given 
in table HI. 

Table III. Diagnosis Table 



Errcr[l] 


Error[0] 


Diagnosis 








There is no error on the mes sage on the output. 


1 





There was Dne error on the c odeword the 
message is equivalent to the original. 


D 


1 


There are two errors on the codeword no 
correction hav e b een made. 


1 


1 


Not possible. 



Error diagnosis is done by examining the syndrome bits. 
If the S[4] of received syndrome is '0', the corrector checks 
whether the bits S[3:0] is '0000' or not. If it is '0000', it is 
inferred that no error has occurred. If bits S[3:0] is not '0000' , 
it is inferred that a double bit error has occurred. If the S[4] is 
' 1 ' , then it is inferred that a single bit error has been occurred. 

Mask Generation and Error Correction 

Mask is used to correct the message. 11 -bit mask is 
generated from the value of syndrome. The value of syndrome 
indicates the position of erroneous bit. The corresponding 
bit position of the mask is set high. The corrected message is 
calculated by XORing the mask and the decoded data from 
the decoder. 

Corrected output data <= mask A decoded data (4) 

IV. Simulation Result 

The Controller along with memory can be simulated using 
Modelsim software. The functional simulation result of the 
wrapper module is shown in Fig. 9. Two clock signals are 
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shown sys_clk and elk. Former is the clock input to the 
controller. Latter is the clock input to the memory. It is 
generated by the controller. The designed IP core can be 
further prototyped using Altera DE2 FPGAboard[9]. 



fevii DtoJop.«c/ck 

^c«*olfi_it(>_*:c/d«to_»i 

/contdtr _Vp_«c/ dq 
■:.>.■•:■•>■ • < _ ijv. 
■':■>'■■ -± '■ -ii 

fan 'j iAji_Uw_Kr/«_fl 




W 

IICOIIOOIIIIKIMO 
M 
DO 

sn 
so 
sn 
so 
5ti 




Figure 9. Simulation result 

Note on simulation result : 

Reset : (sys_clk= I'M, clk= l'bl, reset = l'bl, cmd = 3'bxxx, 
dm = 2'bxx, data_in = 1 l'bxxxxxxxxxxx, addr = 
23'bxxxxxxxxxxxxxxxxxxxxxxx, data_out =11 'bzzzzzzzzzzz, error 
= 2'b00) 

Initialization: (sys_clk= l'bl,clk= l'bl,reset= 1'bO, cmd = 
3'bxxx, dm = 2'bxx, data_in = 1 1 'bxxxxxxxxxxx, 
addr=23'b000000000000000000 10001, data_out = ll'b 
00000000000, error = 2'b00) 

Activate : (sys_clk = l'bl, elk = l'bl, reset = 1'bO, cmd = 
3'bl01, dm = 2'b00, data_in = 1 l'bl 1001 1001 10, addr = 
23'bOOOOOOOOOOOOOOOOOOOOOOO, data_out = 1 l'bOOOOOOOOOOO, 
error =2'b00) 

Write :(sys_clk= l'bl,clk= l'bl,reset= 1'bO, cmd = 3'bll0, 
dm = 2'b00, data_in = 1 l'bl 10011001 10, addr = 
23'b00000000000000000000000, data_out = 1 l'bOOOOOOOOOOO, 
error =2'b00) 

Idle: (sys_clk= l'bl, elk = l'bl, reset = 1'bO, cmd = 3'b000, 
dm = 2'b00, data_in = 1 l'bxxxxxxxxxxx, addr = 
23'bxxxxxxxxxxxxxxxxxxxxxxx,data_out= 1 rbOOOOOOOOOOO, error 
= 2'b00) 

Activate : (sys_clk = l'bl, elk = l'bl, reset = 1'bO, cmd = 
3'bl01, dm = 2'b00, data_in = 1 l'bxxxxxxxxxxx, addr = 
23'b00000000000000000000000, data_out = 1 l'bOOOOOOOOOOO, 
error =2'b00) 

Read : (sys_clk= l'bl, clk= l'bl, reset = 1'bO, cmd = 3'blll, 
dm = 2'b00, data_in = 1 l'bxxxxxxxxxxx, addr = 
23'b00000000000000000000000, data_out = 1 l'bl 1001 1001 10, 
error =2'b00) 

V. Implementation Results 

The controller module is tested by using Altera® DE2 
FPGA board. Sys_clk is taken from the board itself. Reset is 
given as trigger input. Altera Quartus® II is used to synthe- 
sis the design. Signal Tap® II Logic Analyzer is used to take 
hardware test result from FPGA. Resource utilization and 
Power analysis result is listed in table IV and table V respec- 
tively. 
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Figure 10. Hardware test result 
Table IV. Resource Utilization Table 





USAGE 


Estimated Total loeic ?l?m?nts 


l : 376 


Total c ombinational functions 


696 


L d Eic - 1 - ti 'i - p t u saE? b v rjurjQ b er o f LUT 
inputs 




- - 4 input functions 


367 


— 3 input functions 


173 


— 2 input functions 


15* 


Logic. ?l?m?nts by mod? 




— normal mod? 


5:5 


- - arithmetic mod? 


40 


Total registers 


1116 


■ - Dedicated lo gee registers 


1116 


— L'O registers 





I/O pins 


52 


Total mamDry bits 


64:: 


Maximum fan- out nod? 


sys_clk 


Maximum fan- out 


731 


T otal fan-out 


41403 


Averag? tan- out 


3.34 



Resource Utilization table summarizes usage statistics for 
resources including logic elements, registers, I/O pins, 
memory blocks, interconnect usage, and fan-out. 

Table V. Power Analysis Table 



PARAMETER 


RESULT 


Total Th?nnal Power Dissipation 


115.87 mW 


Cor? Dynamic Thannal Power Dis sipation 


0.00 nrtV 


Cor? Static Thermal Power Dis sipation 


79.94 mW 


I/O Thermal Power Dis sipation 


35.92 mW 
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Power Analysis table gives details about the core dynamic 
thermal power dissipation, core static thermal power 
dissipation, I/O thermal power dissipation and total thermal 
power dissipation. 

Conclusion 

This paper describes the the design, simulation and 
characterization of synthesizable SDRAM Controller IP core 
with built in ECC module. The design was developed using 
Verilog HDL. It can be easily modified for different system 
design requirements. The proposed design has been tested 
by implementing the design on Altera DE2 board which uses 
Cyclone-II device. 
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